Concurrent Warehouse Maintenance Without Compromising Session Consistency
نویسندگان
چکیده
To achieve acceptable query response times in data warehouse environments, the data have to be pre-aggregated according to the need of the applications and stored redundantly. These materialized views must be kept up to date, to assure their usability. In this paper we present an algorithm which improves the maintenance of materialized views in two ways. In the first place, the maintenance process is accelerated by supporting different maintenance strategies for individual views. Thus, views which do not need to be up to date can be maintained in a deferred or periodical way. In the second place, the algorithm provides session consistency, assuring that during an OLAP-session the underlying data do not change from the user’s point of view. The algorithm facilitates high flexibility and performance of view maintenance without compromising session consistency.
منابع مشابه
An Architecture of a Data
We present incremental view maintenance algorithms for a data warehouse derived from multiple distributed autonomous data sources. We begin with a detailed framework for analyzing view maintenance algorithms for multiple data sources with concurrent updates. Earlier approaches for view maintenance in the presence of concurrent updates typically require two types of messages: one to compute the ...
متن کاملConcurrent View Maintenance Scheme for Soft Real-time Data Warehouse Systems
The data warehouse should be refreshed periodically or aperiodically in order to prevent OLAP transactions from reading extremely stale data. This refreshing process is referred to as view maintenance. There have been many researches on ways to preserve data consistency during the process of view maintenance. The simplest approach among them is to perform view maintenance in the nighttime while...
متن کاملConsistency in a Stream Warehouse
A stream warehouse is a Data Stream Management System (DSMS) that stores a very long history, e.g. years or decades; or equivalently a data warehouse that is continuously loaded. A stream warehouse enables queries that seamlessly range from realtime alerting and diagnostics to long-term data mining. However, continuously loading data from uncontrolled sources into a realtime stream warehouse in...
متن کاملExploiting Versions for On-line Data Warehouse Maintenance in MOLAP Servers
A data warehouse is an integrated database whose data is collected from several data sources, and supports on-line analytical processing (OLAP). Typically, a query to the data warehouse tends to be complex and involves a large volume of data. To keep the data at the warehouse consistent with the source data, changes to the data sources should be propagated to the data warehouse periodically. Be...
متن کاملIncremental ETL Pipeline Scheduling for Near Real-Time Data Warehouses
We present our work based on an incremental ETL pipeline for on-demand data warehouse maintenance. Pipeline parallelism is exploited to concurrently execute a chain of maintenance jobs, each of which takes a batch of delta tuples extracted from source-local transactions with commit timestamps preceding the arrival time of an incoming warehouse query and calculates Ąnal deltas to bring relevant ...
متن کامل